A Novel Approach for Outlier Detection using Rough Entropy
نویسندگان
چکیده
Outlier detection is an important task in data mining and its applications. It is defined as a data point which is very much different from the rest of the data based on some measures. Such a data often contains useful information on abnormal behavior of the system described by patterns. In this paper, a novel method for outlier detection is proposed among inconsistent dataset. This method exploits the framework of rough set theory. The rough set is defined as a pair of lower approximation and upper approximation. The difference between upper and lower approximation is defined as boundary. Some of the objects in the boundary region have more possibility of becoming outlier than objects in lower approximations. Hence, it is established that the rough entropy measure as a uniform framework to understand and implement outlier detection separately on class wise consistent (lower) and inconsistent (boundary) objects. An example shows that the Novel Rough Entropy Outlier Detection (NREOD) algorithm is effective and suitable for evaluating the outliers. Further, experimental studies show that NREOD based technique outperformed, compared with the existing techniques. Key-Words: Data Mining, Outlier, Rough Set, Classification, Pattern recognition
منابع مشابه
Rough K-means Outlier Factor Based on Entropy Computation
Many studies of outlier detection have been developed based on the cluster-based outlier detection approach, since it does not need any prior knowledge of the dataset. However, the previous studies only regard the outlier factor computation with respect to a single point or a small cluster, which reflects its deviates from a common cluster. Furthermore, all objects within outlier cluster are as...
متن کاملOutlier Detection in Wireless Sensor Networks Using Distributed Principal Component Analysis
Detecting anomalies is an important challenge for intrusion detection and fault diagnosis in wireless sensor networks (WSNs). To address the problem of outlier detection in wireless sensor networks, in this paper we present a PCA-based centralized approach and a DPCA-based distributed energy-efficient approach for detecting outliers in sensed data in a WSN. The outliers in sensed data can be ca...
متن کاملRODHA: Robust Outlier Detection using Hybrid Approach
The task of outlier detection is to find the small groups of data objects that are exceptional to the inherent behavior of the rest of the data. Detection of such outliers is fundamental to a variety of database and analytic tasks such as fraud detection and customer migration. There are several approaches[10] of outlier detection employed in many study areas amongst which distance based and de...
متن کاملSome issues about outlier detection in rough set theory
‘‘One person’s noise is another person’s signal” (Knorr, E., Ng, R. (1998). Algorithms for mining distancebased outliers in large datasets. In Proceedings of the 24th VLDB conference, New York (pp. 392–403)). In recent years, much attention has been given to the problem of outlier detection, whose aim is to detect outliers – objects which behave in an unexpected way or have abnormal properties....
متن کاملA Rough Set Approach to Spatio-temporal Outlier Detection
Detecting outliers which are grossly different from or inconsistent with the remaining spatio-temporal dataset is a major challenge in real-world knowledge discovery and data mining applications. In this paper, we deal with the outlier detection problem in spatio-temporal data and we describe a rough set approach that finds the top outliers in an unlabeled spatio-temporal dataset. The proposed ...
متن کامل